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1.  Introduction 

Consider  a  region  S  that  contains  M  mines.  Suppose  that  an  action  is  taken  to  remove 
the  mines  from  S,  that  Y  mines  are  removed,  and  that  a  query  is  made  about  the  number  of 
mines  XsM-Y  that  remain  in  5.  If  Af  is  known,  then,  since  Y is  also  known,  so  is  X.  In 
the  common  case  where  M  is  unknown,  however,  there  is  bound  to  be  some  uncertainty 
about  the  number  of  mines  that  remain  after  the  clearance  operation,  and  therefore  about 
whether  S  has  been  sufficiently  “sanitized”.  A  quantitative  analysis  of  the  situation  will 
require  a  probability  distribution  for  X,  which  will  in  turn  depend  on  the  prior  probability 
distribution  for  Af.  If  the  distribution  of  Af  is  of  a  particular  type,  the  Katz  type  that  is 
shortly  to  be  described,  then  the  distribution  of  X  will  be  of  the  same  type,  a  simplifying 
feature.  Katz  distributions  also  have  some  other  appealing  properties,  so  there  are  reasons 
to  begin  a  minefield  clearance  analysis  by  assuming  a  Katz  distribution  for  Af.  The  purpose 
of  this  paper  is  to  summarize  some  of  these  properties. 

The  clearance  action  is  assumed  to  be  nonexhaustive,  by  which  is  meant  that  the 
operation  cannot  guarantee  to  remove  every  mine  (if  it  were  exhaustive,  then  of  course  X 
would  be  0  regardless  of  Af).  Instead,  the  clearance  action  will  be  assumed  to  remove 


every  mine  independently  with  known  probability  p,  with  0  <  /?  <  1.  the  usual  assumption 
in  studying  minefield  clearance.  In  general  p  depends  on  the  amount  of  clearance  effort, 
sweep  widths,  and  perhaps  other  parameters,  but  the  nature  of  that  dependence  is  not  of 
concern  here.  Parameter  p  will  be  referred  to  as  the  “level”  of  clearance. 

Although  minefield  clearance  is  the  motivating  application,  of  course  the  “mines” 
could  actually  be  ore  pockets,  oil  strikes,  unexploded  ordnance,  piles  of  doggie-doo 
(domestic  minefield),  or  whatever.  The  basic  idea  is  that  an  imperfect  attempt  is  made  to 
find  a  scattered  collection  of  objects. 

Section  2  defines  Katz  distributions  and  summarizes  what  is  already  known  about 
them.  Other  properties  of  Katz  distributions  are  derived  in  Sections  3  and  4,  particularly  in 
Theorem  1  where  it  is  shown  that  X  still  has  a  Katz  distribution  after  Y  is  observed. 
Although  Sections  2-4  will  continue  to  refer  to  the  objects  being  studied  as  “mines”,  the 
material  in  those  sections  is  really  just  a  collection  of  abstract  but  possibly  useful  facts 
about  Katz  distributions.  The  implications  for  models  of  minefield  clearance  are  addressed 
in  Section  5. 

2.  Katz  Distributions  (definition) 

Katz  (1965)  describes  a  probability  distribution  jui),  xi, ...  with  the  property  that 

The  distribution  (1)  will  be  referred  to  as  a  “Katz  distribution  with  parameters  a  and  p\ 
provided  a  and  meet  certain  requirements  that  will  be  specified  in  the  next  paragraph. 
Given  equation  (1)  determines xi,  x^, ... .  Since  the  sum  +xi  +  ...  must  be  1,  x^  is 
determined  implicitly. 

The  parameter  a  must  be  normegative,  since  it  is  the  ratio  x^/xq,  and  )3  must  be  less 
than  1  to  enforce  convergence  to  0  for  large  7.  If  )9  <  0,  then  (1)  will  eventually  produce 
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negative  probabilities  unless  -a//3  is  an  integer.  To  prevent  this  possibility,  -ajp  is 
required  to  be  an  integer  when  j3  is  negative.  The  restrictions  on  parameters  are  thus  that 

a>0,  )3<1,  and  -a//3  is  an  integer  when  j3  <  0.  (2) 

oo 

Let  the  generating  function  be  g(z;a, j3)  -^xjz^ .  Katz  (1965)  showed  that 

7=0 

*{r,a,ffi  =  [(I-j3)/(I-ft)r'^.  (3) 

with  (3)  being  interpreted  as  exp(-oz)  (the  limit  as  approaches  0)  if  =  0.  It  follows  that 
the  initial  probability  must  be 

=  =  (4) 

If  M  is  a  random  variable  with  a  Katz  distribution  with  parameters  a  and  (hereafter 
abbreviated  M  ~  K{ol,  j3)),  then 

E{M)- ii  =  al{\-P)  and  Var(Af)  =  (T^  =a/(l-)3)^.  (5) 

It  is  not  hard  to  establish  that  a  Katz  distribution  is 

•  if  ;3  <  0,  a  binomial  distribution  with  -q/)3  trials  and  success  probability  ;3/()3  -  1), 

•  if  )3  =  0,  a  Poisson  distribution  with  mean  a,  or 

•  if  >  0,  a  negative  binomial  distribution.  If  aJfi  is  an  integer,  this  is  the  distribution 
of  the  number  of  failures  until  the  aJp^  success  when  the  failure  probability  is  p. 
However,  the  “number  of  successes”  aJ^  can  actually  be  any  positive  real  number 
in  this  case. 

The  Katz  class  includes  no  other  distribution,  so  it  can  be  thought  of  as  the  union  of  three 
familiar  types. 

Since  the  mean  and  variance  are  more  familiar  parameters  than  a  and  )3,  the  solution  of 
(5)  for  a  and  )3  in  terms  of  fx  and  6^  may  be  useful: 

=  and  a  =  )U^/cr^.  (6) 
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Clearly  ^  ^  0  and  o2  >  o  in  (6),  but  some  nonnegative  (fi,  a)  pairs  are  impossible  because 
of  the  restriction  that  -aJ^  must  be  an  integer  when  /3  is  negative.  This  restriction  is  aoi 
imposed  by  Katz  (1965),  who  simply  zeros  all  probabilities  after  (and  including)  the  first 
that  (1)  would  make  negative.  Unfortunately,  this  tactic  falsifies  equations  (3)  -  (6).  For 
example  suppose  a  =  1  and  )3  =  -2.  Then  (1)  has  xiIxq=  1  and  X2lxi  =-1/2,  so  Katz 
would  take  xq  =  xi  =  1/2,  jc,-  =  0  for  i  >  2.  The  mean  of  this  distribution  is  =  1/2,  not  1/3 
as  would  be  obtained  by  (5).  The  fact  that  (3)  -  (6)  are  false  when  P  <  0  and  -aJ^  is  not 
an  integer  is  not  recognized  in  Katz  (1965),  nor  in  subsequent  restatements  such  as 
Johnson  and  Kotz  (1969). 

Since  )3  =  1  -  all  (jt,  o^)  pairs  where  0  <  /r  <  are  possible.  This  covers 
situations  where  there  is  great  uncertainty  about  the  number  of  mines  present 

3.  Useful  Katz  Properties 

3.1  Sample-Observe-Subtract  (SOS) 

The  main  property  that  makes  Katz  distributions  useful  in  minefield  clearance  is  that 
the  class  is  closed  under  the  kind  of  SOS  operations  described  in  the  introduction. 
Formally, 

Theorem  1:  Let  M  be  the  number  of  mines,  suppose  M  ~  K(a,  p),  let  T  be  the  number 
of  mines  removed  when  each  mine  is  removed  with  probability  p,  independently  of  the 
others,  and  let  X  =  Af  -  K  be  the  number  of  mines  remaining  (not  removed).  Then, 
conditional  on  the  event  (F=y)  being  given,  X  ~  Kia',  fi'X  where  a'  and  j3'  are  given 
by  (10)  with  5  =  1 -p. 

Proof:  Let  =  Pr(M  =  j)  and  x*j  =  Pr(X  =  j\Y  =  y);j  =  0,....  Then 
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(7) 


x*j  Pr(y  =  y)  =  Pr(F  =  V  and  X  =  /1 

=  Pr(F  =  y  ^  M  =  y  +  j) 

=  Pr(y  =  yjM  =  y + ;■)  Pr(Af  =  y + j, 

But  Pr(y  =  y\M  =  y+j)  is  the  binomial  probability  of  y  successes  in  y  +;  trials,  so,  letting 
q=\-p, 

X*  Pr(y  =  >’)  =  (^  ^  V^y+ y ;  7  =  0, . . .  (8) 

Taking  the  ratio  of  successive  terms  in  (8),  the  factor  Pr(K= y)  cancels  and 


The  first  {  }  factor  in  (9)  is  a  ratio  of  combinatorial  coefficients,  and  the  second  is  by 
assumption  Xy+j^i/Xy^j.  The  two  (y  +j  +1)  factors  in  (9)  cancel,  so  (9)  is  again  a  linear 
function  of  7*  divided  byj  +  1,  as  was  to  be  shown.  If  a  and  )9  satisfy  (2),  it  is  easy  to  check 
that  the  same  is  true  of  the  revised  parameters  a'  and  fi',  where 

a'  =  q{a  +  py)  p'  =  qp.  (10) 

This  concludes  the  proof.  □ 

Glazebrook  and  Boys  [1995]  introduce  a  larger  class  of  distributions  that  is  still  closed 
under  the  SOS  operation.  Binomial  distributions  are  generalized  to  “light  tailed” 
distributions,  negative  binomial  distributions  are  generalized  to  “heavy  tailed” 
distributions,  and  the  Poisson  distribution  continues  to  play  its  central  role.  The  Katz  class 
can  be  regarded  as  a  two-parameter  subset  with  convenient  analytic  properties. 

Theorem  1  resolves  a  certain  minefield  paradox.  Suppose  that  a  minefield  is  cleared  to 
the  .5  level,  and  that  Y  mines  are  removed  in  the  process.  One  might  argue  that  Y  mines 
must  remain,  since  only  half  have  been  removed.  But  how  can  it  be  that  the  number 
estimated  to  remain  should  increase  with  the  number  cleared,  since  clearance  is  by  its 
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nature  subtractive?  The  paradox  disappears  when  one  realizes  that  clearance  to  a  known 
level  provides  both  evidence  and  removal.  When  j3  >  0,  the  evidence  part  dominates  and 
the  estimated  number  remaining  does  indeed  increase  with  the  number  removed.  When 
<  0,  the  removal  part  dominates.  In  the  Poisson  case  /3  =  0,  the  number  removed  does 
not  affect  the  distribution  of  the  number  that  remain. 

Since  clearance  is  a  process  carried  out  in  time,  it  is  likely  that  clearance  times 
T^,  ...,Ty  will  also  be  known  when  (i'  =  y)  is  observed.  If  the  magnitudes  of  these  times 
influence  the  posterior  distribution  of  M,  then  the  clearance  times,  as  well  as  the  number  of 
mines  cleared,  should  be  accounted  for.  However,  there  is  no  effect  of  this  kind  as  long  as 
the  clearance  level  p  is  calculable,  regardless  of  the  initial  distribution  of  the  number  of 
mines.  The  proof  of  this  statement  can  be  found  in  Appendix  A. 

3.2  Simple  Sampling 

Theorem  1  governs  the  case  where  the  number  of  mines  removed  (10  is  observed. 
There  are  also  circumstances  where  Y  is  not  observed.  One  example  is  where  M  is  the 
number  of  mines  in  region  S,  but  only  some  fraction  qofS  (call  it  S'  i  is  of  concern.  If  is 
interpreted  to  be  the  probability  that  any  given  mine  will  be  in  5',  then  the  number  of 
mines  X  in  5'  is  the  number  remaining  after  sampling  M  at  the  level  q,  but  without 
observing  the  results  of  the  sample.  Theorem  2  states  that  X  is  still  Katz. 

Theorem  2:  Let  M  be  the  number  of  mines,  suppose  M  ~  K{a,  )3),  and  let  X  be  the 
number  of  mines  in  the  sample  when  each  mine  is  included  with  probability  q, 
independently  of  the  others.  Then  X  ~  K{a',  j3').  where  a'  and  are  given  by  (14)  with 

p=\-q. 
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Proof: 


Since  X  is  binomial  when  Af  is  given. 


;=0  i=0^  ^ 


=  Y^Xj{qz  +  py 

;=o 


(11) 


(12) 


=  g{qz  +  p-,a,P).  (13) 

Equation  (12)  is  obtained  from  (11)  by  combining  the  factors  q^  and  z*,  and  then 
employing  the  Binomial  Theorem.  Equation  (13)  is  obtained  from  (12)  by  recalling  the 
definition  of  the  generating  function  g( ).  After  recalling  (3)  and  rearranging  (13),  X  can  be 
shown  to  be  Katz  with  parameters 


/  05? 

a  =-^ 


l-pp 


^  l-Pp 


(14) 


If  a  and  j3  satisfy  (2),  then  so  do  a'  and  p' .  D 

Of  course,  the  number  of  mines  Y  removed  from  M  is  also  Katz,  but  with  p  and  q 
reversed  in  (14). 


3.3  Simple  Initial  Threat  (SIT) 

Uncertainty  about  the  number  of  mines  implies  uncertainty  about  whether  the 
minefield  is  safe  for  a  transitor  to  cross.  The  simplest  quantification  is  to  define  the 
parameter 

t  s  probability  that  a  given  mine  kills  the  transitor,  ( 15) 

and  then  assume  that  all  mines  act  independently.  For  example,  suppose  that  mines  are 
distributed  uniformly  and  independently  in  a  minefield  with  width  W,  that  each  mine 
actuates  with  probability  B  if  the  transitor’s  straight  line  path  takes  it  to  within  A/2  of  the 
mine,  and  that  the  transitor  will  be  killed  with  probability  D,  conditional  on  actuation. 
Then,  as  long  as  IV » A  and  the  transitor’s  path  is  near  the  center  of  the  minefield 
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(ignoring  edge  effects,  in  other  words),  the  parameter  t  is  ABDIW.  However,  t  does  not 
need  to  be  calculated  in  that  way-  the  calculation  could  involve  actuation  curves, 
navigation  errors,  and  edge  effects  as  in  Odle  (1977). 

The  transitor  is  assumed  to  encounter  the  mines  one  at  a  time.  As  long  as  the  transitor 
survives,  the  probability  that  the  next  mine  kills  it  is  by  assumption  t,  independently  of  any 
others.  The  probability  that  all  M  mines  fail  to  MU  the  ttaositor  is  therefore  ( 1  - 1)^,  and 
the  probability  that  the  first  transitor  to  enter  the  minefield  is  killed  is 

SITsl-£((l-ry^).  (16) 

If  M~K{a,  p),  equation  (16)  can  be  evaluated  by  recalling  that  E{z^)  =  giz;  oc,  p)  and 
substituting  1  -  f  for  z;  that  is,  SIT  =  1  -  g(l  -  r,  a,  j3).  If  clearance  is  carried  out  before 
the  transitor  enters  the  minefield,  then  of  course  a'  and  p'  from  (10)  or  (14)  would  be 
substituted  for  a  and  P 

3.4  Threat  to  Following  Transitors 

The  second  and  following  transitors  are  much  harder  to  deal  with  analytically  than  the 
first.  Odle  (1977)  gives  formulas  for  several  multi-transitor  measures,  but  derivation  is 
non-trivial  even  when  the  number  of  mines  is  known.  An  exception  is  the  “catastrophic 
failure”  probability  c„,  the  probability  that  none  of  n  transitors  are  sunk.  Let  be  the 
catastrophe  probability  for  a  single  mine.  Then  c„  is  simply  for  m  independent  mines. 

If  Af  ~  K{a,  p),  then  the  (average)  catastrophe  probability  is 

c„(o,j3)  =  £:{e*')  =  *(e„;a,p).  (17) 

where  g( )  is  the  generating  function  given  by  (3).  Odle  gives  the  formula  when  M  is 
Poisson,  a  special  case.  As  in  the  case  of  SIT,  the  important  thing  is  that  the  generating 
function  of  M  be  known. 
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would  be  (1  -  r)"  if  each  transitor’s  track  were  chosen  independently  of  the  others, 
but  multiple  transitors  are  usually  assumed  to  attempt  to  follow  the  same  track.  In  that 
case  the  computation  of  Q„  can  become  a  significant  task  in  itself,  particularly  if 
navigation  errors  are  involved,  but  the  degree  of  difficulty  has  nothing  to  do  with  the 
distribution  of  the  number  of  mines  present. 

There  appear  to  be  no  simple,  closed-form  formulas  other  than  (17)  when  muitiplc- 
transitors  are  present,  even  when  the  number  of  mines  is  known.  There  are  practical 
methods  for  calculating  the  casualty  distribution  and  other  statistical  measures  (Odle, 
1977),  but  the  methods  do  not  simplify  when  the  number  of  mines  has  a  Katz  (or  even  a 
Poisson)  distribution. 

A  simple  upper  bound  on  E„,  the  expected  number  of  casualties  out  of  n  transitors, 
can  be  obtained  by  observing  that  the  number  of  casualties  cannot  exceed  M,  and  therefore 
that  cannot  exceed  E(M).  If  each  mine  causes  a  casualty  with  probability  at  most  D 
whenever  it  detonates,  then  a  better  bound  is 

E„<DE(M).  (18) 

If  M  ~  K(a,  p),  then  E(M)  is  given  by  (5).  Since  £„  is  necessarily  a  nondecreasing  function 
of  n,  (10)  is  sharpest  for  large  values  of  n.  Of  course,  Ey  =  SIT. 

4.  Sums  and  Partitions 

Throughout  this  section  there  are  n  independent  mine  populations  A/,-,  with 
Ml  ~  K{ai,  p),  It  will  be  assumed  that  a,-  >  0,  since  otherwise  Mj  =  0  and 

population  i  could  be  omitted.  The  total  number  of  mines  is  A/h  Mi+...+  M^.  The 
clearance  level  for  the  population  is  p,-,  with  s  1  -  Pi-  The  number  of  type  i  mines 
cleared  is  T/,  with  ¥=¥[+...+  ¥„,  and  the  number  remaining  is  X,-,  with  X  =  Xi+...+  X^. 
Of  course  X,-  +  F,-  =  Mi  and  X  +  ¥  =  M.  These  mine  populations  might  be  different  kinds  of 
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iTiines  in  one  minefield,  the  numbers  of  mines  in  different  minefields,  or  any  other  partition 
of  M  into  n  parts. 

4.1  Sums 

If  all  of  the  mine  populations  have  Katz  distributions,  does  the  total  number  of  mines 
M  also  have  a  Katz  distribution? 


Theorem  3:  If  j9,-  =  /3  for  all  i,  then  M  ~  Kia,  ^3),  where  a  =  Oj  +...+  Otherwise,  M 
does  not  have  a  Katz  distribution. 

Proof:  Since  the  My  are  all  independent,  the  generating  function  of  M  is 

/=i 

If  =  /3  for  all  /,  then  (19)  reduces  to  g(z)  =  [(l-)3)/(l-)3z)]“^^,  the  generating  function 

of  a  Katz  random  variable.  Otherwise,  (19)  does  not  have  the  required  form  and  M  is 
therefore  not  Katz.  D 


rorotlarv  1:  If  <7yj3y  =  for  some  parameter  )3,  /  =  1, ...,  /i,  and  if  K,  is  observed  for 
/=!,...,  n,  then 


X~K 


\i=l 


Proof:  According  to  (10),  Xj K{aiqi  + pYi,qiPi)  when  Y.  is  given.  Since 

q.fi.  =  )3,  the  conclusion  that  X  has  a  Katz  distribution  then  follows  from  Theorem  3.  D 
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CoroUarv  2:  Suppose  (?,  =  (l/jS,- 1)/(1/J3- 1)  for  some  ^  Tlien 

X  -  Ki(Xn[Oj,  i3),  where 

<^TOT 

i=l 

If  instead pi  =  (l//3j  -  l)/(l/y3  -  1)  for  n,  then  F ~  K(a^Q^, 

Proof:  The  condition  on  qj  enforces  Pi  =  P  and  aj  =  in  (14),  which 

applies  when  the  number  of  mines  cleared  is  not  observed.  The  conclusion  then  follows 
from  Theorem  3.  If  the  condition  on  p,-  holds,  then  the  same  logic  implies  to  Y,  the  number 
of  mines  not  removed.  □ 

If  X  and  Y  in  corollary  2  are  both  to  have  Katz  distributions,  then  it  is  necessary  that  pi 
and  qi  both  be  proportional  to  (l/j3j  -  1).  This  is  not  possible  unless  j3,  =  0  for  all  i  (in 
which  case  )3  is  also  0  —  this  is  the  Poisson  case),  or  if  j9;  and  p,-  are  both  independent  of  i. 
Either  of  these  conditions  also  ensures  that  X  will  be  Katz  in  the  SOS  condition  where  F) 
is  observed  for  all » as  in  corollary  1. 

If  the  only  possibilities  in  clearance  are  that  the  clearance  of  each  type  is  either 
observed  or  not  observed,  then  the  number  of  mines  of  each  type  remains  Katz  after 
clearance,  and  independence  is  preserved  between  populations.  Other  kinds  of 
observations,  however,  can  destroy  this  Katz  structure.  One  example  would  be  an 
observation  that  the  total  number  of  mines  X  cannot  exceed  some  limit  because  of  logistic 
considerations.  Of  mote  concern  is  the  possibility  that  F  might  be  observed,  without 
knowing  all  of  its  components. 

To  be  precise,  suppose  that  F  is  known  without  knowing  the  identity  of  any  of  the 
mines  that  have  been  cleared.  Are  the  residual  numbers  X,  still  independent  and  Katz  under 
this  condition?  The  answer  is  yes  if  F  =  0,  since  the  observation  that  F  =  0  is  equivalent  to 
the  observation  that  F,-  =  0  for  all  /.  The  answer  is  also  yes  if  p  =  1,  since  in  that  case 
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Xi  =  0  for  all  i.  One  might  hope  that  the  answer  would  still  be  yes  even  if  /><  1  and  F  >  0, 
provided  jS,-  =  j3  for  all  i,  since  the  latter  condition  is  sufficient  for  F  to  be  Katz. 
Unfortunately,  this  is  not  true.  Appendix  B  shows  that  conditional  independence  fails 
unless  ^  =  0. 

If  =  0  for  all  I,  then  the  Xj  are  still  Poisson  and  independent  when  F  is  given  because 
the  information  in  F  is  irrelevant;  the  distribution  of  X^  has  nothing  to  do  with  the  number 
of  mines  cleared.  Thus,  it  is  only  in  the  Poisson  case  where  the  individual  populations 
remain  independent  and  Katz  when  F  alone  is  observed. 


4.2  Partitions  and  MCK  Distributions 

If  M  -  X(a,  J3),  then  a  closed  form  expression  for  the  probability  mass  function  of  Af, 
valid  ifj3<0or^>0,  is 


P(M  =  m)  =  ^  ;„^o.  (21) 

m! 

The  notation  (x)^  is  taken  from  Feller  (1957)  where  {x)^  is  defined  tobex(Jc-l)  -- 
(x  -  m  +  1)  for  m  ^  I,  with  (jc)o  a  1  (m  is  a  nonnegative  integer,  but  x  can  be  any  real 
number).  The  limit  as  produces  a  Poisson  distribution,  so  in  that  sense  (21)  is  valid 
for  all  (a,  p)  satisfying  (2).  If  Mj  ~  X(a,-,  p),  and  if  Mi, ...,  Af„  are  all  independent,  then 
M-K{a,p)  according  to  Theorem  3.  Let  M ^  (^i, . . .,  M„),  and m s  (mi,  Then 

nP(M,  =  m,) 

P(M  =  m|M  =  m)= - ; — ;  m;  20,m  =  mi+.. .+»■„.  (22) 

'  P{M  =  m) 

All  of  the  factors  involving  (1  -  j3)  and  (-p)  raised  to  powers  cancel  in  (22),  leaving 


y  (m,)! 

P(M  =  m|Af  =  m)  = - ;  m,- ^0,m  =  mi+...+m„. 

\^/  P)fn 

(m)! 


(23) 
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The  distribution  (23)  will  be  referred  to  as  a  “multivariate  conditional  Katz  distribution 
with  parameters  a,  j3,  and  m”,  or  MCK  for  short.  The  MCK  distribution  is  a  multivariate 
hypergeometric  distribution  when  )3  <  0,  or  a  multinomial  distribution  in  the  limit  as  0 
(Johnson  and  Kotz,  1969).  When  j3  >  0,  the  MCK  distribution  has  been  called  a 
multivariate  Polya-Eggenberger  distribution  (Johnson  and  Kotz,  1977)  on  account  of  its 
relationship  to  certain  um-sampling  schemes,  or  simply  the  multivariate  Polya  distribution 
(Janardin  and  Patil,  1970).  Thinking  of  Af,-  as  the  number  of  balls  in  an  urn  leads  to  a 
practical  way  of  generating  M  in  a  Monte  Carlo  simulation,  since  only  a  single  Katz 
sample  of  the  total  M  is  really  required.  This  is  the  gist  of  Theorem  4. 


Theorem 4:  Let  M ~ K(a, p),  where  a=  ai  +...+  a„,  a,- ^0  for  l^i<n.  The  pair 
(a,-,  p)  is  assumed  to  satisfy  (2)  for  1  ^  i  ^  n.  Consider  the  following  procedure  for  placing 
MbaUs  in  n  urns.  For  it  =  0, ...,  Af-  1,  the  k  +  l^t  ball  is  placed  in  um  i  with  probability  pi, 
where 


*  a + pk  ' 


(24) 


and  where  it,  is  the  number  of  balls  already  in  um  i.  If  Af^  is  the  number  of  balls  finally 
placed  in  um  i,  then  Af,-  ~  K(aj,  p),  and  aU  of  the  Af^  are  independent  of  each  other, 
I  =  1, ...,  n.  Q 


Proof:  Let  M  =  (Afi, . . . ,  and  m  s  (mi, . . . ,  m„).  It  will  be  shown  by  induction 

that  P(M  =  m|Af  =  m)  is  given  by  (23)  for  m  >  0.  Since  (23)  is  equivalent  to  (22),  the 

theorem  follows  upon  removing  the  condition  on  Af. 

Let  Q(m)  be  P(M  =  ra|Af  =  mi+...+m„),  and  note  that  Q(Q)  =  1,  a  special  case  of 

(23)  where  mi  +...+  m„  =  0.  Suppose  Q(m)  is  given  by  (23)  for  all  m  such  that  mi  +... 
+  mfi  =  k;  let  «,•  be  an  n-vector  aU  of  whose  components  are  zero  except  for  component  i, 
which  is  1;  and  let  ife,  =  m,-  -  1,  i  =  1, . . .,  n  (if  fc/  <  0,  then  the  corresponding  term  may  be 
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omitted  from  (25)  below).  Then,  conditioning  on  the  ball  configuration  after  k  balls  have 
been  placed, 

(25) 

n 

where  ra  is  now  any  configuration  such  that  =  k  +  l.  Q(m -  e,)  on  the  light  hand 

i=l 

side  of  (25)  is  by  assumption  given  by  (23),  and  it  is  now  only  a  matter  of  some  algebra  to 
conclude  that  Q(fn)  on  the  left  hand  side  is  also  given  by  (23).  Since  la  is  arbitrary  except 
for  its  sum,  this  completes  the  inductive  proof.  CD 

Comment:  When  j3  =  0,  (24)  makes  Pi  =  a, /a  for  every  ball.  The  fact  that  a  Poisson 
random  variable  produces  independent  Poisson  parts  when  partitioned  in  this  manner  is 
well  known  (e.g.  Ross  (1993)).  When  13  96  0,  if  each  ball  is  placed  in  urn  i  with  probability 
a/a,  instead  of  according  to  (24),  then  by  Theorem  2  where 

a<=aj/(l-)3(l-a//a))  and  =)3(aj/a)/(l-jS(l-af/a)).  £(Mj)  is  still 
but  it  is  not  true  that  and  furthermore  Mi,  ...,M„  are  not  mutually 

independent.  These  latter  properties  require  that  the  balls  be  allocated  according  to  (24). 

5.  Tactical  Decision  Aids  (TDA’s)  for  Minefield  Clearance 

Barring  the  possibility  of  exhaustive  search,  any  mine  clearance  campaign  has  got  to 
cope  with  the  problem  of  deciding  when  to  stop.  Stopping  after  a  fixed  time  is  of  course 
an  option,  but  it  makes  sense  to  let  the  stopping  time  depend  on  results  achieved  to  date, 
particularly  when  there  is  as  much  initial  uncertainty  as  is  usually  the  case  in  mine 
clearance.  As  a  minimum,  therefore,  any  decision  aid  for  mine  clearance  should  be  able  to 
display  the  “status”  of  a  clearance  effort  in  terms  that  support  the  stopping  decision,  or, 
more  generally,  decisions  about  what  should  be  done  next.  The  natural  status  of  a 
minefield  is  the  risk  that  it  poses  to  the  traffic  against  which  it  was  designed.  In  simplest 
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terras  this  risk  is  raeasured  by  SIT.  SIT  depends  strongly  on  ne  number  of  mines 
remaining,  so  it  is  hard  to  resist  the  conclusion  that  the  number  of  mines  initially  present 
must  be  an  input,  even  if  the  number  is  so  vaguely  known  that  the  input  must  be  a 
probability  distribution.  Without  some  input  or  assumption  about  the  number  of  mines 
initially  present,  it  is  hard  to  imagine  how  a  basically  subtractive  clearance  activity  could 
result  in  sufficient  knowledge  about  the  number  of  mines  remaining  to  support  a 
computation  of  SIT. 

In  spite  of  the  above  considerations,  current  (1996)  mine  clearance  TDA’s  typically  do 
not  require  the  number  of  mines  present  to  be  an  input.  There  are  a  variety  of  reasons  for 
this,  but  the  only  important  point  is  that  the  reader  should  understand  that  the  necessity  for 
a  distribution  for  the  number  of  mines  to  be  an  explicit  input  is  arguable. 

Even  if  one  accepts  the  idea  that  the  number  of  mines  M  must  be  thought  of  as  a 
random  variable,  it  does  not  necessarily  follow  that  M  should  have  a  Katz  distribution, 
since  Bayes  Theorem  could  just  as  well  be  applied  to  a  general  distribution.  A  general 
distribution  would  require  storing  1000  numbers  if  the  maximum  conceivable  number  of 
mines  were  999.  A  Katz  distribution  requires  only  2,  but  performing  a  Bayesian  update  on 
a  general  distribution  is  trivial  with  a  modem  computer;  in  a  different  context, 
NODESTAR  (Stone  and  Corwin,  1995)  performs  such  updates  with  10^  states,  rather 
than  only  10^.  Using  a  general  distribution  would  also  have  the  advantage  that  any 
observation  with  a  known  concUtional  probability  law  could  be  the  basis  of  a  Bayesian 
update,  which  is  not  true  in  the  Katz  case.  The  idea  of  using  general  distributions  does  not 
become  computationally  unwieldy  until  multiple  random  variables  must  be  described 
jointly.  If  there  were  for  example  3  mine  types,  the  number  of  each  of  which  does  not 
exceed  999,  then  there  would  be  10^  joint  possibilities.  Today’s  computers  cannot  perform 
Bayesian  updates  on  that  scale.  The  Katz  assumption  provides  no  relief  from  this  kind  of 
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explosion,  since  there  is  no  useful  theory  for  multiple  Katz  randorr  variables  unless  they 
are  all  independent 

On  the  other  hand,  it  is  also  true  that  very  little  is  lost  by  restricting  input  distributions 
to  be  of  the  Katz  type.  The  two  Katz  parameters  are  sufficient  for  quantifying  the  center 
and  spread  of  a  distribution,  and  it  is  hard  to  imagine  knowing  enough  about  the  number 
of  mines  to  need  more  detail.  In  fact,  the  Katz  restriction  may  be  operationally  welcome, 
since  the  entire  distribution  is  determined  from  only  two  estimated  numbers.  With  these 
thoughts  in  mind,  one  TDA  (MIXER)  proposed  by  the  author  (Washburn,  1995)  employs 
Katz  distributions  exclusively,  requiring  the  user  to  quantify  uncertainty  by  providing  a 
mean  and  standard  deviation  for  each  mine  type. 

Katz  distributions  also  have  some  computational  advantages  compared  to  the  general 
case.  At  any  point  in  a  clearance  operation  where  the  clearance  leve^  and  the  number(s)  of 
mines  cleared  are  known,  it  is  easy  to  compute  revised  Katz  parameters  (formula  (10))  and 
then  SIT  (formula  (16))  while  the  comparable  operations  in  the  gene^  case  would  require 
extensive  computation.  The  Katz  divisibility  properties  described  in  Theorem  4  could  also 
prove  handy.  If  a  region  containing  M  mines  must  be  divided  into  two  parts,  then  Theorem 
4  describes  how  the  number  of  mines  in  the  two  parts  can  be  independent,  Katz,  and  still 
sum  to  M.  The  comparable  operation  in  the  general  case  may  be  difficult  or  impossible. 
Perhaps  most  important,  the  availability  of  an  analytic  expression  for  SIT  opens  up  the 
possibility  (as  in  MIXER)  of  posing  the  mathematical  problem  of  minimizing  SIT  subject 
to  constraints  on  the  clearance  effort,  a  computational  problem  that  would  be  much  more 
difficult  in  the  general  case. 

Appendix  B  shows  that  one  seemingly  irmocent  observation  (total  mines  cleared)  can 
create  analytical  havoc  when  multiple  Katz  populations  are  present  -  the  residual  mine 
populations  are  not  independent.  Letting  the  populations  have  general  distributions  would 
not  relieve  this  problem,  since  the  general  case  includes  the  K^  case.  Rigorously 
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processing  measurements  of  this  sort  will  require  Bayesian  updates  of  joint  distributions, 
regardless  of  the  nature  of  the  marginal  distributions.  The  point,  again,  is  that  there  is  little 
to  be  gained  by  permitting  marginal  distributions  to  be  general,  rather  than  Katz. 

The  difficulty  described  in  the  paragraph  above  could  be  relieved  by  forcing  all 
distributions  to  be  Poisson,  as  noted  on  page  11.  Poisson  distributions  also  have  other 
important  analytic  advantages,  but  unfortunately  have  only  a  simple  parameter.  For 
example  a  Poisson  distribution  with  mean  100  necessarily  has  a  standard  deviation  of 
(only)  10.  Inclusion  of  distributions  with  )3  >  0  (negative  binomial  distributions)  in  the 
permitted  class  seems  essential  to  model  the  large  uncertainty  about  mine  numbers  that  is 
to  be  expected. 

In  summary,  the  Katz  class  of  distributions  is  large  enough  to  support  mine  clearance 
TDA’s,  and  offers  several  convenient  analytic  properties.  The  Poisson  class  would  be  even 
more  convenient,  but  is  not  large  enough.  Permitting  general  distributions  would  lose  the 
convenient  Katz  properties,  and  therefore  should  be  done  only  if  there  is  some  use  for  the 
added  flexibility. 

In  minefields  where  multiple  mine  types  are  present,  all  theorems  proved  above  require 
an  independence  assumption.  If  observations  that  would  destroy  independence  are 
contemplated,  then,  in  spite  of  the  implied  computational  burden,  a  Bayesian  TDA  will 
have  to  be  based  on  general,  multivariate  distributions.  In  other  words,  Katz  distributions 
appear  to  be  the  right  answer  only  as  long  as  observations  preserve  independence  between 
populations. 
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APPENDIX  A 

Clearance  times  have  no  additional  value 

When  the  number  of  mines  cleared  (F)  is  observed,  it  is  likely  ti*at  the  clearance  times 
Uy,  ...,Uy  will  also  be  observed.  These  times  turn  out  to  have  no  additional  value  in 
making  inferences  about  the  initial  number  of  mines  M,  whether  or  not  M  has  a  Katz 
distribution,  and  therefore  no  value  for  the  residual  number  of  mines  M-Y.  This  result 
may  seem  counterintuitive.  If  one  searches  for  24  hours,  finding  5  mines  in  the  first  hour 
and  none  thereafter,  then  intuition  argues  that  there  are  probably  no  remaining  mines, 
whereas  there  might  be  more  mines  if  the  clearance  times  were  scattered  over  the  whole 
clearance  period.  This  intuition  would  be  correct  if  the  probability  law  F( )  governing  the 
detection  times  were  unknown,  since  there  is  information  about  F( )  in  the  clearance  times. 
If  F( )  is  known,  however  (as  it  must  be  if  the  clearance  level  is  calculable),  then  the 

corollary  below  states  that  the  clearance  times  are  useless. 

> 

Theorem;  Let  Af  be  a  nonnegative  random  variable,  and  let  Tj, . . . ,  be  independent 
random  variables  with  common  distribution  function  F( ).  Let  r  be  a  iy  real  number,  let  /j 
indicate  the  event  let  Y=  ly +  ...■¥ I and  let  U=  (£/i, ...,  C/y),  where 

U\,  ...,Uy  are  the  nondecreasing  order  statistics  of  those  for  which  T,-  ^  r.  If  m  and  y 
are  nonnegative  integers  for  which  0  <  y  <  m,  and  if  u  =  (uy, . . .,  is  a  real  vector,  then 
either  Pr(y  =  y,  M  =  m)  =  0,  or 

Pr(U  =  u|y  =  y,  Af  =  m)  =  Pr(U  =  iijy  =  y).  (Al) 

Proof:  Both  sides  of  (Al)  are  well  defined  if  Pr(y  =  y,  Af  =  m)  >  0.  Furthermore, 

both  are  0  unless  uy  <  t  for  i  =  1, ...,  y  and  «,  <  uy+y  for  i  =  1, ...,  y  -  1,  so  suppose  that 
those  conditions  hold.  Define  the  event 

y  m 

m)n(rj  =  »,  )  ncj;  >  ')•  (A2) 

/=!  i=y+l 


18 


Then 


Pr(^)7n )  =  Pr(A/  =  "»)  fl  ) 


i=l 


[I -''(»)] 


m-  ^ 


(A3) 


The  event  (U  =  u)n(Af  =  m)  includes  and  other  mutually  exclusive  events  that  have 

the  same  probability,  since  the  first  y  of  the  are  not  necessarily  the  smallest  If  all 
components  of  u  are  unequal,  the  number  of  these  events  is  number  of 

permutations  of  m  things  taken  y  at  a  time.  More  generally,  let  there  be  K  distinct 

components  in  u,  with  being  the  number  of  times  the  ^  is  repeated.  Then  there  are 
events,  where  j  is  the  multinonual  coefficient  for  y  things  taken  nj, 

ni,  at  a  time.  Every  subset  of  {r^, ...,  of  size  y  can  be  assigned  to  the 

components  of  u  in  j  different  ways.  Thus 

Pr(U  =  u,  M  =  m)  =  Pr(£^ ).  (A4) 

Since  Pr(l’  =  }’,M  =  m)  =  Pr(M  =  m)(“V(l)>’[l-f(()]"“^,  it  U  a  simple  matter  to  take 
the  ratio  Pr(U  =  u,  M  =  m)/Pr(F = y,  M  =  m)  to  obtain 

y 

Pr(U  =  ulT  =  y,M  =  m)  =  )/ ^(O]-  (A5) 

But  the  right  hand  side  of  (A5)  does  not  depend  on  m,  so  it  must  also  be  Pr(U  =  u|y  =  y). 
Conditional  on  (F=y)  being  given,  the  order  statistics  U  are  distributed  as  if  U  were  the 
order  statistics  of  the  truncated  distribution  F(  )/F{t),  sampled  y  times.  Cl 

CoroUarv:  Either  Pr(U  =  u,T  =  y)  =  0  or  Pr(Af  =  m|U  =  u, F  =  y)  =  Pr(M  =  m|y  =  y). 


Proof!  Let  A  =  (U  =  u),  B  =  (F= y),  C  =  (Af  =  m),  and  assume  Pr(A  n  B)  >  0.  It 

foUows  that  Pr(ClAnB)  and  Pr(ClB)  are  well  defined,  and  both  are  zero  if 
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Pr(B  n  C)  =  0.  If  Pr(5  n  O  >  0,  then  Pr(>i|B  n  C)  =  Pr(A|B)  >  0  by  the  theorem,  and  all 
that  remains  is  to  write  the  definition  of  Pr(C|AnB),  cancel  equal  factors,  and  observe 
that  the  result  is  Pr(ClB).  D 
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APPENDIX  B 

A  counterexample  to  conditional  independence 

Suppose  Ml  ~  K{a,  p),  i  =1,2,  and  let  M  =  Xj  +  X2  and  a  =  ai  +  02-  Each  of  the  M 
mines  is  cleared  with  probability  p<  I,  independent  of  the  others.  Let  Yi  be  the  number  of 
mines  of  type  i  cleared,  let  F  =  Fj  +  F2,  let  X,-  =  M,-  -  F,-  be  the  number  of  type  i  mines 
remaining  after  clearance,  and  let  ^  =  1  -  p.  It  will  be  shown  that  X^  and  X2  are  not 
independent  under  the  condition  F=  1,  unless  15=  0.  It  suffices  to  show  the  same  thing 
about  the  events  (Xj  =  0)  and  (X2  =  0). 

Let  £  be  the  event  (Xj  =  0  and  X2  =  0  and  F  =  I),  and  let  Fj  be  the  event  (Xj  =  0  and 
Y=  1).  F  is  the  same  as  the  event  that  exactiy  one  mine  is  present  and  that  it  is  cleared,  so, 
using  (21)  and  the  fact  that  M  -  K{a,  p)  by  Theorem  2, 


P{E)  =  pP{M  =  1)  =  pa(l  -  pf^^ .  (B 1) 

Fi  is  the  union  of  two  mutually  exclusive  events 

G  =  (Xi  =  1  and  Fi  =  1  and  F2  =  0)  and 
H  =  (Xi  =  0andX2^  1  andF2  =  1). 

F(G)  is  just  pP{Xi  =  1)  F(F2  =  0),  but  evaluating  P(H)  requires  a  summation: 


PiH)  =  F(Xi  =  0)X  P(X2  =  jljpqj-^l 
y=i 


(B2) 


with  the  factor  in  []  being  a  binomial  probability.  The  sum  can  be  evaluated  by 

^V)8 


(  l-B 

differentiating  the  generating  function  of  X2.  Letting  y  =  - — — 

yl-pq 


,  the  result  is  that 


P{H)  =  pP{Xi  =  0) 


«2 

l-jS? 


.aj 


(B3) 


Since  F(F2  =  0)  =  and  F(Xi  =  1)  =  aiFCXj  =  0),  P(Fi)  can  be  obtained  by  summing 
P(G)  +  P{H): 


P(Fl)  =  pP(Xi  =  0)r“Vi  =  (B4) 
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where /i  =  a^  +  a^{\  -  I5q).  Defining  F2  similarly  to  Fj,  it  follows  by  symmetry  tiiat 

(B5) 

where /2  =  0^2  +  Oj/(l  -  I5q). 

If  the  events  (X^  =  0)  and  (X2  =  0)  are  to  be  independent  when  the  event  (f  =  1)  is 

given,  it  should  be  true  that  P(Fi)P(F2)  =  P{E)P{Y  =  1),  so  consider  the  ratio  R  = 

F(Fi)F(F2)/(F(F)F(F=  1)).  Since  F(F  =  l)  =  -^^7“,  all  factors  are  known  and  R  can 

l-Pq 

be  computed.  Most  factors  cancel,  leaving  only 

R  =  {l-pq)Af2/cc^.  (B6) 

It  can  be  shown  using  simple  algebra  that  (1  -  j39)/i/2- ^  ^(2^0,  with 

equality  possible  only  if  /3  =  0.  Thus  (X^  =  0)  and  (X2  =  0)  are  not  conditionally 
independent.  A  slightly  stronger  statement  is  possible:  the  probability  of  having  0  mines 
remaining,  given  F=  1,  is  larger  than  the  prediction  based  on  conditional  independence. 


22 


REFERENCES 


Feller,  W.  1957.  An  Introduction  to  Probability  Theory  and  its  Applications  (2nd  ed.), 
Wiley,  p.  48. 

Glazebrook,  K.D  and  Boys,  R.J.  1995.  “A  Class  of  Bayesian  Models  for  Optimal 
Exploration,” of  the  Royal  Statistical  Society  (B),  57,  No.4,  pp.  705-720. 

Janardin,  K.  and  Patil,  G.  1970.  “On  the  Multivariate  Polya  Distribution:  A  Model  of 
Contagion  for  Data  wid»  Multiple  Counts.”  Random  Counts  in  Scientific  Work  IL 
Random  Counts  in  Physic^  Science,  Geo-Science,  and  Business  (ed.  G.P.  Patil).  The 
Pennsylvania  State  University  Press,  pp.  143-162. 

Johnson,  N.  and  Kotz,  S.  1969.  Discrete  Distributions,  Wiley,  NY,  p.  36-43, 

Johnson,  N.  and  Kotz,  S.  1977.  Urn  Models  and  Their  Applications,  Wiley,  NY,  p.  194. 

Katz,  L.  1965.  “Unified  Treatment  of  a  Broad  Class  of  Discrete  Probability  Distributions,” 
in  Classical  and  Contagious  Discrete  Distributions  (G.  Patil,  ed.).  Statistical 
Publishing  Society,  Pergamon  Press,  Oxford. 

Odle,  J.  1977.  Minefield  Analysis  for  Channelized  Traffic,  NSWC/WOL  TR  77-109, 
Naval  Surface  Weapons  Center,  White  Oak,  MD. 

Ross,  S.  1993.  Probability  Models  (5th  ed.)  Academic  Press,  NY,  pp.  216-217. 

Stone,  L.  and  Corwin,  T.  1995.  “NODESTAR:  A  Nonlinear,  Discrete,  Multiple-target, 
Correlator-tracker:  Part  1,”  U.S.  Navy  Journal  of  Underwater  Acoustics,  45,  No.  3, 

pp.  1-16. 

Washburn,  A.  1995.  “MIXER:  A  TDA  for  Mixed  Minefield  Clearance,”  NPSOR-95- 
OllPR,  Naval  Postgraduate  School,  Monterey,  CA  93943. 


23 


DISTRIBUTION  LIST 


1.  Research  Office  (Code  08) . 

Naval  Postgraduate  School 
Monterey,  CA  93943-5000 

2.  Dudley  Knox  Library  (Code  52) . 

Naval  Postgraduate  School 
Monterey,  CA  93943-5002 

3.  Defense  Technical  Information  Center 
8725  John }.  Kingman  Rd.,  STE  0944 
Ft.  Belvoir,  VA  22060-6218 

4.  Department  of  Operations  Research  .. 
Editorial  Assistant  (Code  OR/  Bi) 

Naval  Postgraduate  School 
Monterey,  CA  93943-5000 

5.  Prof.  Alan  R.  Washburn  (Code  OR/Ws) 
Naval  Postgraduate  School 
Monterey,  CA  93943-5000 

6.  COMINEWARCOM  (N02R) . 

325  Fifth  St  SE 

Corpus  Christi,  TX  78419 

7.  Coastal  Systems  Station  . . 

6703  W.  Highway  98 

Panama  City,  FL  32407-7001 

8.  Center  for  Naval  Analyses . 

4401  Ford  Avenue 

P.O.  Box  16268 
Alexandria,  V A  22302-0268 

9.  Office  of  Naval  Research  (322TE) . 

800  North  Quincy  Street 
Arlington,  VA  22217-5660 

10.  Daniel  H.  Wagner,  Assoc . 

2  Eaton  St.,  Suite  500 

Hampton,  VA  23669 


